De novo transcriptome analysis of Bagarius yarrelli (Siluriformes: Sisoridae) and the search for potential SSR markers using RNA-Seq
نویسندگان
چکیده
BACKGROUND The yellow sisorid catfish (Bagarius yarrelli) is a carnivorous freshwater fish that inhabits the Honghe River, Lanchangjiang River and Nujiang River of southern China and other Southeast Asian countries. However, the publicly available genomic data for B. yarrelli are limited. METHODOLOGY AND PRINCIPAL FINDINGS Illumina Solexa paired-end technology produced 1,706,456 raw reads from muscle, liver and caudal fin tissues of B. yarrelli. Nearly 5 Gb of data were acquired, and de novo assembly generated 14,607 unigenes, with an N50 of 2006 bp. A total of 9093 unigenes showed significant similarities to known proteins in public databases: 4477 and 6391 of B. yarrelli unigenes were mapped to the Gene Ontology (GO) and Clusters of Orthologous Groups (COG) databases, respectively. Moreover, 9635 unigenes were assigned to 242 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. In addition, 8568 microsatellites (simple sequence repeats, SSRs) were detected, and 31 pairs of polymorphic primers were characterized using wild populations of B. yarrelli from the Nujiang River, Yunnan Province, China. CONCLUSION/SIGNIFICANCE These sequences enrich the genomic resources for B. yarrelli and will benefit future investigations into the evolutionary and biological processes of this and related Bagarius species. The SSR markers developed in this study will facilitate construction of genetic maps, investigations of genetic structures and germplasm polymorphism assessments in B. yarrelli.
منابع مشابه
Clustering of Short Read Sequences for de novo Transcriptome Assembly
Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...
متن کاملLarge Scale Identification of SSR Molecular Markers in Ajowan (Trachyspermum ammi) Using RNA Sequencing
The medicinal plant, Trachyspermum ammi is a rich source of active pharmaceutical ingredients with pharmaceutics effects. Microsatellite markers play a key role in the genome and gene expression, especially in secondary metabolite biosynthesis in medicinal plants. For the first time, transcriptome sequencing of this herb medicine was carried out to identify the microsatellite markers of this sp...
متن کاملOsteology and myology of the cephalic region and pectoral girdle of Glyptothorax fukiensis (Rendahl, 1925), comparison with other sisorids, and comments on the synapomorphies of the Sisoridae (Teleostei: Siluriformes)
The cephalic and pectoral girdle structures of the sisorid Glyptothorax fukiensis (tribe Glyptothoracini) are described and compared with those of representatives of the other three sisorid tribes, namely Glyptosternon reticulatum (tribe Glyptosternini), Bagarius yarreli (tribe Bagariini) and Gagata cenia (tribe Sisorini), as well as with those of several other catfishes, as the foundation for ...
متن کاملI-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing
Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...
متن کاملMining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.
Microsatellites, or simple sequence repeats (SSRs), are one of the most informative and multi-purpose genetic markers exploited in plant functional genomics. However, the discovery of SSRs and development using traditional methods are laborious, time-consuming, and costly. Recently, the availability of high-throughput sequencing technologies has enabled researchers to identify a substantial num...
متن کامل